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AUTOMATED INFORMATION PROCESSING IN RANDOMLY ORDERED ARRAYS 



FIELD OF THE INVENTION 

The invention re.ates to the use of a computer system to compare images generated from a randomly 
ordered array. This system preserves the relative position of each site within the array so that the 
same site can be compared in different images. 

BACKGROUND OF THE INVENTION 

There are a number of assays and sensors for the detection of the presence and/or concentration of 
specific substances in fluids and gases. Many of these rely on specific ..gand/antiligand reactions as 
the mechanism of detection. That is, pairs of substances (i.e. the binding pairs or ligand/anti.igands) 
are known to bind to each other. whi.e binding little or not at all to other substances. This has been 
the focus of a number of techniques that utilize these binding pairs for the detection of the complexes. 
These generally are done by labeling one component of the complex in some way, so as to make the 
entire complex detectable, using, for example, radioisotopes, fluorescent and other optically active 
molecules, enzymes, etc. 

Of particular use in these sensors are detection mechanisms utilizing luminescence or fluorescence 
Recentiy. the use of optica, fibers and optica, fiber strands in combination with light absorbing dyes for 
chemical analytical determinations has undergone rapid development, particularly within the last 
decade. The use of optica, fibers for such purposes and techniques is described by Mi.anov.ch et al., 
••Novel Optical Fiber Techniques For Medical Application". Proceedings of the SPIE 28th Annual 
international Technical Symposium On Optics and Electro-Optics. Volume 494, 1980, Se,tz, W.R., 
"Chemical Sensors Based On Immobilized Indicators and Fiber Optics' in C.R.C. Critics, Review in 
Anaiytica, Chemistry. Vol. 19. 1988. pp. 135-173; Wolfbeis, O.S., Tiber Optica. Fluorosensors In 
Analytical Chemistry" in MCecuiar Luminescence Spectroscopy. Methods and Applications (S. G. 
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Seaman, editor), W,ley * Son, New York (1988); Ange.. S.M.. Spectroscopy 2 <*» (1987); Wa, 
et a. "Chemical Sensors and Microinstruments, ACS Symposium Series, Vol. 403 1989, p. 252, 
and Wolfbeis, O.S.. Fiber Optic Cnemica, Sensors, Ed. CRC Press. Boca. Raton, FL. 1991. 2nd 

Volume. 

When using an optical fiber in an ,„ vftt*. wVo sensor, one or more light absorbing dyes are located 
near its distal end. Typically, light from an appropriate source is used to Hluminate the dyes through 
the fiber's proximal end. The light propagates along the length of the optica, fiber; and a port,on of th,s 
propagated light exits the dista. end and is absorbed by the dyes. The light absorbing dye may or may 
not be immobilized; may or may not be directly attached to the optica, fiber itself; may or may not be 
suspended in a fluid sample containing one or more ana.ytes of interest; and may or may not be 
retainable for subsequent use in a second optical determination. 

Once the light has been absorbed by the dye. some light of varying wavelength and intensity returns^ 

and measured. The interactions between the .ight conveyed by the opto, fiber and the proper.es of 
the .ight absorbing dye provide an optica, basis for both quaHtative and quantitative detections. 

Of the many different dasses of light absorbing dyes which conventionally are employed with bundles 
of fiber strands and optica, fibers for different ana.ytica. purposes are those more common • 
compositions that emit .ight after absorption termed "fiuorophores" and those wh.chabsobUght and 
internal convert the absorbed .ight to heat, rather than emit it as light, termed "chromophores. 

Ruorescence is a physical phenomenon based upon the abi-ity of some molecules 
(photons) at specified wavelengths and then em. .ight of a longer wavelength and at a .ower e e y. 
LJesabletofiuoresceshareanumberof common charades: the abifity to a sorb .,gh 
energy at one wave.ength A.; reach an excited energy state; and subsequent, emit l,g t at another 
.ight wave.ength. The absorption and fluorescence emission spectra are individual for each 
lophoreandatoften graphic,, v^.t*^™""™"™ 
The samefluorescenceemission spectrum is genera., 

exciting .ight and. according,, the waveband energy of theexciting light ^^"1, 

the strength of the fluorescence sign, may be measured as the quantum y,,d of 
fluorescence quantumyield is the ratio of the number of photons emitted in ^Pans-tothen m 
of photons initial, absorbed by the fluorophore. For more detailed information regar^ng each of these 
characteristics, the following references are recommended; Lakowicz, J. R.. Pnncptesof 

k I A/ Vrtrlf 1Q oq. Freifelder. D., Physical Biochemistry, 
Fluorescence Spectroscopy, Plenum Press, New York, 1983. Freireiaer, 
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second edition. W. H. Freeman and Company, New York. 1982; "Molecular Luminescence 
Spectroscopy Methods and Applications: Part P (S.G. Schu.man. editor) in Chemica, Ana,ys,s. vol. 77, 
Wley & Sons, Inc.. 1985; The Theory of Luminescence, Stepanov and Gribkovskii. Iliffe Books, Ltd.. 
London, 1968. 

,n comparison, substances which absorb light and do not fluoresce usually convert the light into heat 
or kinetic energy. The ability to intemaily convert the absorbed light identifies the dye as a 
"chromophore." Dyes which absorb light energy as chromophobes do so at individua. wavelengths of 
energy and are characterized by a distinctive molar absorption coefficient at that wavelength. 
Chemical anahysis employing fiber optic strands and absorption spectroscopy using vis.ble and 
ultraviolet .ight wavelengths in combination with the absorption coefficient allow for the determinate 
of concentration for specific ana.yses of interest by spectral measurement. The most common use of 
absorbance measurement v« optica, fibers is to determine concentration which is calculated ,n 
accordance with Beers" law; accordingly, at a single absorbance wave.engtl.the greater the quantity 
of the composition which absorbs light energy at a grven wave.ength. .he greater the optica, densrty for 
the sample. In this way. the total quantity of light absorbed direct.y correiates with the quantity of the 
composition in the sample. 

Many of the recent improvements emptoying optica, fiber sensors in both qua.itative and quantitative 
ana.yt.cal determinations concern the desirabi.ity of depositing and/or immobi.ttng various hght 
absorbing dyes at the dista. end of the optica, fiber. In this manner, a variety of different optica, fiber 
chemical sensors and methods have been reported for specific analytic*, determinations and 
appncations such as pH measurement, oxygen detection, and carbon, dioxide analyses. These 
developments are exempted by the fo.low.ng pubiications: Freeman, et a... Ana, Chen,. 5*98 1983). 
Lippitsch eta... Ana,. Chem. Acta. 205:1. (1988); Wolfbeis et a... Ana,. Che.. 60:2028 (1988); Jordan, 
et a... Ana,. Chem. 59:437 (1987); Lubbers et a... Sens. Actuators 1983; MunkhCm et a... Ta anfa 
35109 (1988); Munkho.m et a... Ana,. Chem. 58:1427 (1986); Seitz. W. R., Ana,. Chem. 56:16A-34A 
(1984) Peterson, et a... Ana,. Chem. 52:864 (1980): Saari. et a, Ana,. Chem. 54:821 (1982); 
a,.. Ana,. Chem. 55:667 (1983); Zhujun et al.. Ana,. Chem. Acta. 160:47 (1984); Schwab, et a,.. Ana . 
Chem 56:2199 (1984); Wolfbeis. O.S., Tiber Optic Chemical Sensors". Ed. CRC Press, Boca Raton. 
FL. 1991. 2nd Volume; and Pantano, P.. Wa.t. D.R.. Ana,. Chem.. 481A-487A. Vol 67. (1995). 

More recent*, fiber optic sensors have been constructed that permit the use of multip.e dyes with a 
sing.e. d,screte fiber optic bund.e. U.S. Pat. No, 5.244.636 and 5.250.264 to Wa.t. et a, disdose 
systems for affixing mu.ti P .e. different dyes on the dista. end of the bun*, the teachings of each of 
these patents being incorporated herein by this reference. The disclosed configurations enab.e 
separate optica, fibers of the bundle to opticaHy access indrvidua. dyes. This avoids the problem of 
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deconvolving the separate signals in the returning light from each dye, which arises when the signals 
from two or more dyes are combined, each dye being sensitive to a different analyte, and there is 
significant overlap in the dyes' emission spectra. 

U.S.S.N.s 08/818,199 and 09/151,877 describe array compositions that utilize microspheres or beads 
on a surface of a substrate, for example on a terminal end of a fiber optic bundle, with each individual 
fiber comprising a bead containing an optical signature. Since the beads go down randomly, a unique 
optical signature is needed to "decode" the array; i.e. after the array is made, a correlation of the 
location of an individual site on the array with the bead or bioactive agent at that particular site can be 
made. This means that the beads may be randomly distributed on the array, a fast and inexpensive 
process as compared to either the in situ synthesis or spotting techniques of the prior art. Once the 
array is loaded with the beads, the array can be decoded, or can be used, with full or partial decoding 
occuring after testing, as is more fully outlined below. 

The use of fiducials for the registration of sequential images has been u$ed in screen printing (U.S. 
Patent No. 5,129,155) and in implants in the human body (U.S. Patent No. 4,991,579) and in various 
image processing (see U.S. Patent Nos. 5,245,676 and 5,129,014). 

Accordingly, it is an object of the present invention to provide biosensors comprising random arrays, 
generally comprising beads distributed at discrete sites on the surface of a substrate, that utilize 
computer systems and fiducials to allow comparison of sequential data images of the arrays. 



SUMMARY OF THE INVENTION 



In accordance with the objects outlined above, the present invention provides array compositions 
comprising a substrate with a surface comprising discrete sites, at least one fiducial, and a population 
of microspheres comprising at leasta first and a second subpopulation. Each subpopulation 
comprises a bioactive agent, and the microspheres are distributed on said surface. Each 
subpopulation may optionally comprise a unique optical signature, an identifier binding ligand that will 
bind a decoder binding ligand such that the identification of the bioactive agent can be elucidated, or 
both. 

In an additional aspect, the invention provides compositions comprising a computer readable memory 
to direct a computer to function in a specified manner. The computer readable memory comprises an 
acquisition module for receiving a data image of a random array comprising a plurality of discrete 
sites, a registration module for registering a data image, and a comparison module for comparing 
registered data images. Each module comprises computer code for carrying out its function. The 
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registration module may utilize any number of fiducials, including a fiducial fiber when the substrate 
comprises a fiber optic bundle, a fiducial microsphere, or a fiducial template generated from the 
random array. 

In a further aspect, the invention provides methods of making the array compositions of the invention 
comprising forming a surface comprising individual sites on a substrate, distributing microspheres on 
the surface such that the individual sites contain microspheres, and incorporating at least one fiducial 
onto the surface. When the array has complete rotational freedom, at least two fiducials are preferred 
in the array to allow for correction of rotation. 

In an additional aspect, the invention provides methods for comparing separate data images of a 
random array. The methods comprise using a computer system to register a first data image of the 
random array to produce a registered first data image, using the computer system to register a second 
data image of the random array to produce a registered second data image.jnd comparing the first 
and the second registered data images to determine any differences between them. 

In a further aspect, the invention provides methods of decoding a random array composition 
comprising providing a random array composition as outlined herein. A first plurality of decoding 
binding ligands is added to the array composition and a first data image is created. A fiducial is used 
to generate a first registered data image. A second plurality of decoding binding ligands is added to 
the array composition and a second data image is created. The fiducial is used to generate a second 
registered data image. A computer system is used to compare the first and the second registered 
data image to identify the location of at least two bioactive agents. 

In an additional aspect, the invention provides methods of determining the presence of a target analyte 
in a sample. The methods comprise acquiring a first data image of a random array composition, and 
registering the first data.image to create a registered first data image. The sample is then added to 
the random array and a second data image is acquired from the array. The second data image is 
registered to create a registered second data image. Then the first and the second registered data 
images are compared to determine the presence or absence of the target analyte. Optionally, the data 
acquisition may be at different wavelengths. 

FIGURES 



Figure 1 illustrates a fiber optic bundle with fiducial fibers. 
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Figure 2 illustrates the components of a multi-multi fiber including fiducial markers, optical fiber 
bundles (multi fiber) and the components of a single optical fiber (mono fiber). 

DETAILED DESCRIPTION OF THE INVENTION 

The present invent is directed to the use of randomly ordered arrays comprising a bead-based 
analytic chemistiy system in which beads, a.so termed microspheres, carrying different chemical 
functionalities are distributed on a substrate comprising a patterned surface of discrete sites that can 
bind the individua. microspheres. The beads are generally put onto the substrate randomly, 
bead goes down arbitrarily or indescriminate* on to a site. This allows the synthesis of the candle 
agents (i e. compounds such as nucleic acids and antibodies) to be divorced from their placement on 
an array i.e. the candidate agents may be synthesized on the beads, or on a different substrate and 
then put onto the beads, and then the beads are randomly distributed on a patterned surface. 

However the random p.acement of the beads means that all or part of tine array must be "decoded" 
after synthesis; that is, after the array is made, a correlation of the location of an individual site on the 
array with the bead or candidate agent at that particular site can be made. This encoding/decoding 
can be done in a number of ways, as is generally described in 60/090.473; 09/189.543; 08/944.850; 
08/818 199; 09/151,877; and 08/851.203. al. of which are hereby express* incorporated by reference 
in their entirety. These methods include: (1) "encoding" the beads with unique optical signatures 
general* fluorescent dyes, that can be used to identify the chemical functionality on any particular 
bead- (2) using a decoding binding ligand (DBL). generally direct* labe.ed. that binds to e,ther the 
bioactive agent or to identifier binding ligands (IBLs) attached to the beads; (3) the use of postal 
decoding, for example by either targeting the placement of beads (for example by using 
photoactivatible or photocleavable moieties to allow the selective addition of beads to part.cu.ar 
locations), or by using either sub-bundles or selective loading of the sites, as are more fully outimed 
below (4) the use of selective decoding, wherein only those beads that bind to a target are decoded; 
or (5) combinations of any of these. In some cases, as is more fully outlined below, this decoding may 
occur for all the beads, or only for those that bind a particular target analyte. Similarly, this may occur 
either prior to or after addition of a target analyte. 

This means that the beads may be random* distributed on the array. a fast and inexpensive process 
as compared to either the in situ synthesis or spotting techniques of the pnor art. 

Once the identity (i.e. the actual agent) and location of each microsphere in the array has been fixed 
the array is exposed to samples containing the target ana*tes. although as outlined below, ft. can be 
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done prior to. during or after the assay as well. The target analytes will bind to the bioactive agents as 
is more fully outlined below, and results in a change in an optical signal of a particular bead. 

The present invention is directed to compositions and methods that allow comparisons of sequential 
data images taken during decoding and assay analysis. That is, in the broadest sense, the invention 
5 provides computer systems comprising processors and computer readable memory that allow the 
storage and analysis of multiple captured images of the same array, whether to compare a decoding 
image and an experimental image, several experimental images or several decoding images. That is. 
a first data image is taken of a random array, and using either a fiducial template or an external 
fiducial, the data image is registered. A second data image is then taken and registered, and the two- 

1 o registered data images can now be compared, as is more fully outlined below. 

In a preferred embodiment, the present invention provides a variety of "registration" techniques that 
allow the comparison of a variety of these images in a uniform and reliable way. That is. in order to 
compare multiple data images from an array comprising a plurality of unique sites, it is important that 
the correct individual sites be compared during analysis. In a highly complex and small system, 
1 5 methods are needed to ensure that a first site in a first data image is correctly matched to the first site 
in a second data image. Accordingly, the present invention provides the incorporation of one or more 
reference features, also referred to herein as "markers" or "fiducials" or "registration points", that allow 
this registration from image to image. It is generally preferred to have a number of spatially separated 
fiducials so that small amounts of skew and reduction/enlargement can be determined and taken into 

2 0 account 

As is further described below, these fiducials can take a number of forms. For example, when the 
random array comprises beads, the fiducial may be a bead with a unique optical signature or other 
characteristic (Figure 1). When the random array comprises a fiber optic bundle, the fiducial may be a 
fiber element with a unique shape or optical properties. Alternatively, the substrate may have other 
25 types of physical fiducials. such one or more defined edges that have characteristic optical properties 
that can be either spaced along the edge(s) or comprise the entire edge (Figure 2). Alternatively, the 
fiducials may be an inherent characteristic of the array ; for example, small irregularities in the sites 
(features) of the array can be exploited to serve as fiducials, generating a "fiducial template". 

Accordingly, the present invention provides random array compositions comprising at least a first 
3 o substrate with a surface comprising individual sites. By "random" array herein is meant an array that 
is manufactured under conditions that results in the identification of the agent in at least some, if not 
all, of the sites of the array being initially unknown; that is, each agent is put down arbitrarily on a site 
of the array in a generally non-reproducible manner. What is important in random arrays, and what 
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« the t«chn*,ues 0, .he — can be — laid do»» —* - ■ — 

b,a«e agents ,i.e. different -*> » ~» * " ^,7J „ more, depending 
being P oss«. General,,. - «. I" L array. »us v«y h*h dens*, 
on me size of the beads and Ihe substrate, as «e« a, *. en p^y be made. Preferred 

«» den*. — »• * <—» * 1Z a ^ '0 000.000 ,0 about 

arrays range about 100,000 .0 about mo .xM^ooo « about 100.000 being 

parted prew. -7-^3 ZZ* m~* P— - — 
particularly preferred, and from about 20.000 to abo . ^ 

density arrays are fessthan ,.000. w«n from about 10 f^C» * ^ ^ 
,00 te about 500 be*g partly 

ma, not be in array format: that is. for some embedments. ^ „ 
different or identical compositions. Thus for example, large arrays may 



substrates. 



■1. . 1* that nartieulariv through the use of fiber optic 

!—* am,s " ri « — «« - " B p ° ssib,e k 

or less (with beads of 200 nm possible) can be usea, 

hpads in a 1 mm* 

fiber optic bundle, with densities of greater than 25.000.000 ino 
some instant as many as ,00 rntkn) per 0.5 cm- obtainable. 

rr zrrr^iarge. — * 
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glass and modified or functional^ giass, piastics (including actios,, P o.ystyrene and copoiymers of 
tyrene and other materia,, po.ypropy.ene. po.yethy.ene, P o.ybuty,ene. po,yurethanes. Te on * efc.), 
poiysaccharides. ny.on or nitroce.lu.ose, resins, si.ica or si.ica-based materia, inCuding s,con and 
modified sificon, carbon, meta,. inorganic glasses, p.astics, optica, fiber bund.es, and a vanety of 
other poiymers. in genera,, the substrates a.iow optica, detection and do not themse.ves apprecab.y 
fluoresce. 

Genera..y the substrate is fiat (planar). aUhough as wil. be appreciated by those in the art, other 
configurations of substrates may be used as we..; for exam P .e. three dimension* configurations can 
be used, for examp,e by embedding the beads in a porous b.ock of p.astic that^ows samp.e access 
.othebeadsandusingaconf^^ 

the inside surface of a tube, for flow-through sampie ana.ysis to minimi sampie vo.ume. Preferred 
substrates inc.ude optica, fiber bundles as discussed be.ow. and fiat p.anar substrates such as g.ass. 
polystyrene and other plastics and acrylics. _ 

,„ add«, as is more fully o„«ined be,cw. me substrate may indude a coa«ng. edging o, shear, of 
materia,, genera,* de»c«e. ma. defies a «. edge that .ay serve as on, or mere «dt«*, 

,„ a preferred embodiment, »e substrate is an optical « her bund* or array, as Is genera,, descnbed 

express., incorporated nerein try reference. Preferred embodiments utilize preformed un lB ry f*e 

gerjiy Individ-, cad. However, one thing mat distingue a preformed un«y array torn mer 

.eatin me preformed un«a„ arra, with agents that separate mem. for exempt trea W g a Permed 

LI can be separated. However, absent mese intend treatments, erte strand genera,, cannot 
be physical, separated at an, point along its length from another fiber strand. 

At leas, one surface 0, the substrate is modiTred to contain discrete, individual *m*m* 
assoda.onofm.rosp^, These sites me, also be referred to in ™ embody* 
These s«es ma, comcrise ph,s*a.y a»ered s«es. i.e. physic*, configurates such as we ft , • «- 
, depressions in me substrate mat can re* me beads. s,ch met a mfcros^re can -est . me we., or 
U,e use of .me, forces (magnet* or compress**), cr chemfcalr, a»ered o, « ^ «»* as 
chemical,, funcSonaSzed sites. etectrostaKari, altered snes. hydrophobicaihyf hydropr,*ca«, 
functionalized sites, spots of adhesive, etc. 
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The sites may be a pattern, i.e. a regular design or configuration, or randomly distributed. A preferred 
embodiment utilizes a regular pattern of sites such that the sites may be addressed in the X-Y 
coordinate plane. "Pattern" in this sense includes a repeating unit cell, preferably one that allows a 
high density of beads on the substrate. However, it should be noted that these sites may not be 
discrete sites. That is, it is possible to use a uniform surface of adhesive or chemical functionalities, 
for example, that allows the attachment of beads at any position. That is, the surface of the substrate 
is modified to allow attachment of the microspheres at individual sites, whether or not those sites are 
contiguous or noncontiguous with other sites. Thus, the surface of the substrate may be modified 
such that discrete sites are formed that can only have a single associated bead, or alternatively, the 
surface of the substrate is modified and beads may go down anywhere, but they end up at discrete 
sites. 

In a preferred embodiment, the surface of the substrate is modified to contain wells, i.e. depressions in 
the surface of the substrate. This may be done as is generally known in the art using a variety of 
techniques, including, but not limited to. photolithography, stamping techniques, molding techniques 
and microetching techniques. As will be appreciated by those in the art. the technique used will 
depend on the composition and shape of the substrate. 

In a preferred embodiment, physical alterations are made in a surface of the substrate to produce the 
sites. In a preferred embodiment, the substrate is a fiber optic bundle and the surface of the substrate 
is a terminal end of the fiber bundle, as is generally described in 08/818,199 and 09/151.877. both of 
which are hereby expressly incorporated by reference. In this embodiment, wells are made in a 
terminal or distal end of a fiber optic bundle comprising individual fibers. In this embodiment, the cores 
of the individual fibers are etched, with respect to the cladding, such that small wells or depressions 
are formed at one end of the fibers. The required depth of the wells will depend on the size of the 
beads to be added to the wells. 

Generally in this embodiment the microspheres are non-covalently associated in the wells, although 
the wells may additionally be chemically functionalized as is generally described below, cross-linking 
agents may be used, or a physical barrier may be used. i.e. a film or membrane over the beads. 

In a preferred embodiment, the surface of the substrate is modified to contain chemically modified 
sites, that can be used to attach, either covalently or non-covalently, Une microspheres of the invention 
to the discrete sites or locations on the substrate. "Chemically modified sites" in this context includes, 
but is not limited to. the addition of a pattern of chemical functional groups including amino groups, 
carboxy groups, oxo groups and thiol groups, that can be used to covalently attach microspheres, 
which generally also contain corresponding reactive functional groups; the addition of a pattern of 
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adhesive that can be used to bind the microspheres (either by prior chemical functiona.ization for the 
addition of the adhesive or direct addition of the adhesive); the addition of a pattern of charged groups 
(similar to the chemical functionalities) for the eiectrostatic attachment of the microspheres, ...when 
the microspheres comprise charged groups opposite to the sites; the addition of a pattern of chemical 
functions groups that renders the sites differentially hydrophobic or hydrophilic, such that the add.tion 
of simiiariy hydrophobic or hydrophilic microspheres under suitabie experiment* cond-tions w,li result 
in association of the microspheres to the sites on the basis of hydroaffinity. For example, theuse o 
Hydrophobic sites with hydrophobic beads, in an aqueous system, drives the association of the beads 
preferentially onto the sites. As outlined above, "pattern" in this sense includes the use of a un '|°"" 
reatmentofthesurface^^ 

surface resulting in discrete sites. As will be appreciated by those in the art, this may be accomplished 
in a variety of ways. 

,„ a preferred moment, « commons 0. the invention further comprise a pop.la.ion of 
rrfcrospheres. By -popu^r,- he* is mean, a pruralit, of beads as ou.lined above ,o, array, 
v«h,n the are separate »ubp^.a«ne. which oan be , s,n 8 » M» « n-J*. 

W en«ca. c*rospheres. That k. in some embCiments. as is more u, oouner, be*» th ,~m <m 
obtain only a sin* bead <« eaoh b*a*e egent; preferred embodiments tffc, a M °< -ds 
of each type. 

By -microspheres- or "beads" or "partic.es" or grammatical equivalents herein is meant smal. discrete 
particles. The composition of the beads will vary, depending on the class of bioactive agent and ft. 
method of synthesis. Suitable bead compositions include those used in peptide. nuc.e,c aod and 
organic moiety synthesis, including, but not limited to. plastics, ceramics, glass, polystyrene, 
methylsty^^ 

may all be used. "M/crospr.ere Defect G 0 ,de"from Bangs Laboratories, Rshers IN . a helpful 
guide. 

Thebeadsneednotbesphe^irregularpartidesmaybeused. '» addifon ^^ 
porous, thus increasing the surface area of the bead availabie for either bioactive agent attachme or 
tag attachment The bead sizes range from nanometers, i.e. 100 nm, to millimeters, i.e. 1 mm. w,th 
( beads from about 0.2 micron to about 200 microns being preferred, and from about 0.5 to about 5 
micron being particularly preferred, although in some embodiments larger or smaller beads may be 
used. 
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It should be noted that a key component of the invention is the use of a substrate/bead pairing that 
allows the association or attachment of the beads at discrete sites on the surface of the substrate, 
such that the beads do not move during the course of the assay. 

Each microsphere comprises a bioactive agent, although as will be appreciated by those in the art, 
there may be some microspheres which do not contain a bioactive agent, depending the on the 
synthetic methods. By "candidate bioactive agent" or "bioactive agent" or "chemical functionality" or 
•binding ligand" herein is meant as used herein describes any molecule, e.g., protein, oligopeptide, 
small organic molecule, coordination complex, polysaccharide, polynucleotide, etc. which can be 
attached to the microspheres of the invention. It should be understood that the compositions of the 
invention have two primary uses. In a preferred embodiment, as is more fully outlined below, the 
compositions are used to detect the presence of a particular target analyte; for example, the presence 
or absence of a particular nucleotide sequence or a particular protein, such as an enzyme, an 
antibody or an antigen. In an alternate preferred embodiment, the compositions are used to screen 
bioactive agents, i.e. drug candidates, for binding to a particular target analyte. 

Bioactive agents encompass numerous chemical classes, though typically they are organic molecules, 
preferably small organic compounds having a molecular weight of more than 100 and less than about 
2.500 daltons. Bioactive agents comprise functional groups necessary for structural interaction with 
proteins, particularly hydrogen bonding, and typically include at least an amine, carbonyl, hydroxyl or 
carboxyl group, preferably at least two of the functional chemical groups. The bioactive agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic structures 
substituted with one or more of the above functional groups. Bioactive agents are also found among 
biomolecules including peptides, nucleic acids, saccharides, fatty acids, steroids, purines, pyridines, 
derivatives, structural analogs or combinations thereof. Particularly preferred are nucleic acids and 
proteins. 

Bioactive agents can be obtained from a wide variety of sources including libraries of synthetic or 
natural compounds. For example, numerous means are available for random and directed synthesis 
of a wide variety of organic compounds and biomolecules. including expression of randomized 
oligonucleotides. Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant 
and animal extracts are available or readily produced. Additionally, natural or synthetically produced 
libraries and compounds are readily modified through conventional chemical, physical and biochemical 
means. Known pharmacological agents may be subjected to directed or random chemical 
modifications, such as acylation, alkylation, esterification and/or amidification to produce structural 
analogs. 
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,„ a preferred embodiment, the bioactive agents are proteins. By "protein" herein is meant at ieast two 
covalently attached amino acids, which inciudes proteins, polypeptides, oligopeptides and pept.de, 
The protein may be made up of natural, occurring amino acids and pepUde bonds, or synthe c 
peptidomimetic structure, Thus "amino acid", or "peptide residue", as used herein means both 
naturaily occurring and synthetic amino acids. For example, homo-phenytalanine, citrulline and 
leucine are considered amino acids for the purposes of the invention. The side chains may 
either the (R) or the (S) configuration. In the preferred embodiment, the amino acids are ,n the (S) or 
L-configuration. If non-naturaiiy occurring side chains are used, non-amino acid subsUtuents may be 
used, for example to prevent or retard in vivo degradations. 

to one preferred embodiment, the bioactUve agents are naturally occurring proteins or fragments of 
naturaHy occuring protein, Thus, for examp.e. cel.u.ar extracts containing proteins, or random or 
directed digests of proteinaceous ce.lu.ar extracts, may be used. In this way libraries of procaryofcc 
and eu W tic proteins may be made for screening in the systems described herein. ParUcu.ay 
preferred in this embodiment are .ibraries of bacteria., funga.. vira.. and mammaHan prote.ns. w,th the 
latter being preferred, and human proteins being especially preferred. 

,n a preferred embodiment, the bioactive agents are peptides of from about 5 to about 30 amino 
acids with from about 5 to about 20 amino acids being preferred, and from about 7 to 
secularly preferred. The peptides may be digests of naturally occumng proteins as ,s out** 
bove. random peptide, or "bfcsed" random peptide, By "random^ or grammatca, events 
herein is meant that each nuCeic acid and peptide consists of - entia1 *" 
amino acid, respect, Since general, these random peptides (or nucleic ac , d,sc ssed betow) 
are chemica., synthesized, they may incorporate any nudeotide or amino acd ^^J* 
synthetic process can be designed to generate randomized proteins or - cteiCaC ' dS ' t ;; ^ qa 
formation of a., or most of the possible combinations over the length of the sequence, thus formmg a 
library of randomized bioactive proteinaceous agents. 

,„ , Mem* .Knbodune* . H»y <* agent, a,e used. The tibrar, sta*. provide . 

J, one o, «s member- I- a Mr*. M +m . M> *» •« ^ T*' " 

dlffolt to gauge the reauircd astute *. of an interaction «t»». «•«« P«>*" s 3 « «* »" 

sufMentatfcitytointe^ctwithmostpottntlataotigensfaoedbyano.jan^ Pushed ,n 

sunicici y cnfficient to find structures with 

selection techniques have also shown that a library s,ze of 10 to 10 ,s suffiaent to t,no 

affinity for the target Thus, in a preferred embodiment, at least 10*. preferably at least 10 , more 



13 



PCT/US00/03375 

WO 00/47996 

preterabr, a. fees, ,0- and mos, pre-era.* a, leas, ,»• different bloa*e agents are simukaneoosiy 
araly zed in the subfec. methods. Preferred methods maximize libra, size and d«rs«y. 

,„ a pretend MM « « » "» • * » "<"~ "~ " ZT 

a J, pos«io. In a purred embodiment the libra,, is H That is. some pos«o»s -,,n the 

„ a prefer embodiment M nudeotides o, an.no acid residues an, rendered -» a def,n* 

,eLs. threonines, tyrosine, or histidines .0, phosphorylation sites. etc.. or ,» punnes. etc. 

,„ . preferred embodiment, me bioactive agents are nucleic ac«s (general* called -probe nud* 

herei „ meansa.teas.t.onuCeotdesoova^r.Ked aether. ^ -- 7"T 
hrt. wii, gene.Hr oontain phosphodfcste, bonds, e«bougb in some ca*s. as =*ned mm. 
add anatogs an, indeed tha, ma, ha.e a«emate backbones, comprising, fr exampfe 

LL»m. 35:3800 (1970,; «*« e, a,.. iaJJ^ <^">: 
AddsP.es 14-3487 (1986); Sawai. ef «* W*Q«. era,.. UuStOL 
£fj£™ (198 >; and Pauw* eta, SSemZm. »«1 ("M. P-osr—a Mag. 
« N^gJ^ 1*1437 ,1991,; a* U.S. Patent No. 5.844.048, pncapbo^ (Bn, 
ef el' J Am Chem. Soc. 111;2321 (1989)), o-methylphophoroamidite linkages (see Eckstem, 
oitc^a^atogues; APraCca, Ac^. Oxtord U^fry Press,and pep.de nudac 
2 backbones and iinkagea (see Eghdm J^m^ <1« « ^ 

.ajdEngi 31:1008 (1992); Nielsen. Naffi. 385:568 (1993); Caisson. e, al. (Mm 80.207 

posje backbones (Oenpc. ete,., »rWWIW «*" (1995), non^ntc™ 
U.S. PaU0.N0, 5.386.023; 5,637.684; 5.602.240; 5.2,6,41; and 4.469.863; Kredro** a. 

' r-^ ,„ FH Enolish. 30:423 ,1991); Ledger, e, a,.. J^eoLStc. 1104470 (.988). 
X.e,a,Nuc^l . Nu*o*es. ,3:1697 (1994); Chapters 2 and 3. ASC Symposium 
Senes 580. "Carbohydrate «»*— «— V * Sa " 9 '"" ? * , 

described m U.S. Patent Nos. 5,235,033 and 5.034,506. and Chapters 6 and 7. * S °^ P ^^ 
Senes 580 "Carbohydrate Modi«c*ns * An»sen» search". Ed. V.S. Sanghu, and P Dan Cook 

nude* adds (see Jenkins, et a, m^S^- («•» <• 16 '-" 6 >- ' ""^ ^ 
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analogs are described in Rawls, C & E News, June 2. 1997, page 35. All of these references are 
hereby expressly incorporated by reference. These modifications of the ribose-phosphate backbone 
may be done to facilitate the addition of additional moieties such as labels, or to increase the stability 
and half-life of such molecules in physiological environments; for example, PNA is particularly 
preferred. In addition, mixtures of naturally occurring nucleic acids and analogs can be made. 
Alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring nucleic 
acids and analogs may be made. The nucleic acids may be single stranded or double stranded, as 
specified, or contain portions of both double stranded or single stranded sequence. The nucleic acid 
may be DNA, both genomic and cDNA, RNA or a hybrid, where the nucleic acid contains any 
combination of deoxyribo- and ribo-nucleotides, and any combination of bases, including uracil, 
adenine, thymine, cytosine, guanine, inosine, xanthanine, hypoxanthanine, isocytosine, isoguanine, 
and base analogs such as nitropyrrole and nitroindole, etc. 

As described above generally for proteins, nucleic acid bioactive agents may_be naturally occuring 
nucleic acids, random nucleic acids, or "biased" random nucleic acids. For example, digests of 
procaryotic or eukaryotic genomes may be used as is outlined above: for proteins. 

In general, probes of the present invention are designed to be complementary to a target sequence 
(either the target analyte sequence of the sample or to other probe sequences, as is described 
herein), such that hybridization of the target and the probes of the present invention occurs. This 
complementarity need not be perfect; there may be any number of base pair mismatches that will 
interfere with hybridization between the target sequence and the single stranded nucleic acids of the 
present invention. However, if the number of mutations is so great that no hybridization can occur 
under even the least stringent of hybridization conditions, the sequence is not a complementary target 
sequence. Thus, by "substantially complementary" herein is meant Ihat the probes are sufficiently 
complementary to the target sequences to hybridize under the selected reaction conditions. High 
stringency conditions are known in the art; see for example Maniatis et al., Molecular Cloning: A 
Laboratory Manual, 2d Edition, 1989, and Short Protocols in Molecular Biology, ed. Ausubel. et al., 
both of which are hereby incorporated by reference. Stringent conditions are sequence-dependent 
and will be different in different circumstances. Longer sequences hybridize specifically at higher 
temperatures. An extensive guide to the hybridization of nucleic acids is found in Tijssen, Techniques 
in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes, "Overview of principles 
of hybridization and the strategy of nucleic acid assays" (1993). Generally, stringent conditions are 
selected to be about 5-10'C lower than the thermal melting point (T n ) for the specific sequence at a 
defined ionic strength pH. The T m is the temperature (under defined ionic strength, pH and nucleic 
acid concentration) at which 50% of the probes complementary to the target hybridize to the target 
sequence at equilibrium (as the target sequences are present in excess, at T m , 50% of the probes are 
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occupied at equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1 0 M sodium ion. typically about 0.01 to 1.0 M sodium ion concentration (or other salts) at P H 
7 o to 8 3 and the temperature is at .east about 30'C for short probes (e.g. 10 to 50 nuclides) and at 
.east about 60'C for long probes (e.g. greater than 50 nucleotides). Stringent conditions may also be 
achieved with the addition of destabilizing agents such as formamide. In another embodiment, less 
stringent hybridization conditions are used; for exampte. moderate or low stringency conditions may be 
used, as are known in the art; see Maniatis and Ausubel, supra, and Tijssen. supra. 

The term 'target sequence" or grammatical equivalents herein means a nucleic acid sequence on a 
sing.e strand of nucleic acid. The target sequence may be a portion of a gene, a regulatory sequence, 
genomic DNA, cDNA, RNA including mRNA and rRNA. or others. It may be any length. w.th the 
understanding that tonger sequences are more specific. As wi.l be appreciated by those in the art. the 
complementary target sequence may take many forms. For examp.e. it may be contained within a 
.arger nucleic add sequence, i.e. all or part of a gene or mRNA. a restriction fragment of a plasmid or 
genomic DNA, among others. As is outlined more ful.y be.ow. probes are made to hybridize to target 
sequences to determine the presence or absence of the target sequence in a sample. Generally 
speaking, this term will be understood by those skilled in the art. 

in a preferred embodiment, the bioactive agents are organic chemica. moieties, a wide variety of which 

are available in the literature. 

in a preferred embodiment, each bead comprises a single type of bioactive agent. a.though a pluraHty 
of individual bioactive agents are preferably attached to each bead. Similariy. preferred embodiments 
utifize more than one microsphere containing a unique bioactive agent; th* is. there is redundancy 
built into the system by the use of subpopulations of microspheres, each microsphere .n the 
subpopulation containing the same bioactive agent. 

As will be appreciated by those in the art. the bioactive agents may either be synthesfced directly on 
the beads, or they may be made and then attached after synthesis. In a preferred embod.ment, 
.inkers are used to attach the bioactive agents to the beads, to al.ow both good attachment, sufficient 
flexibility to allow good interaction with the target molecule, and to avoid undesirable binding reacfons. 



in a preferred embodiment, the bioactive agents are synthesized direct* on the beads. As is known ,„ 
the art many classes of chemical compounds are currently synthesized on so.id supports, such as 
peptides, organic moieties, and nuclei acids. It is a relatively straightforward matter to adjust the 
current synthetic techniques to use beads. 
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In a preferred embodiment, the bioactive agents are synthesized first, and then covalently attached to 
the beads. As will be appreciated by those in the art, this will be done depending on the composition 
of the bioactive agents and the beads. The functionalization of solid support surfaces such as certain 
polymers with chemically reactive groups such as thiols, amines, carboxyls, etc. is generally known in 
the art. Accordingly, "blank" microspheres may be used that have surface chemistries that facilitate 
the attachment of the desired functionality by the user. Some examples of these surface chemistries 
for blank microspheres include, but are not limited to, amino groups including aliphatic and aromatic 
amines, carboxylic acids, aldehydes, amides, chloromethyl groups, hydrazide. hydroxy! groups, 
sulfonates and sulfates. 

These functional groups can be used to add any number of different candidate agents to the beads, 
generally using known chemistries. For example, candidate agents containing carbohydrates may be 
attached to an amino-functionalized support; the aldehyde of the carbohydrate is made using standard 
techniques, and then the aldehyde is reacted with an amino group on the surface. In an alternative 
embodiment, a sulfhydryl linker may be used. There are a number of sulfhydryl reactive linkers known 
in the art such as SPDP, maleimides. a-haloacetyls, and pyridyl disulfides (see for example the 1994 
Pierce Chemical Company catalog, technical section on cross-linkers, pages 155-200, incorporated 
herein by reference) which can be used to attach cysteine containing proteinaceous agents to the 
support. Alternatively, an amino group on the candidate agent may be used for attachment to an 
amino group on the surface. For example, a large number of stable Afunctional groups are well 
known in the art. including homobifunctional and heterobifunctional linkers (see Pierce Catalog and 
Handbook, pages 155-200). In an additional embodiment, carboxyl groups (either from the surface or 
from the candidate agent) may be derivatized using well known linkers (see the Pierce catalog). For 
example, carbodiimides activate carboxyl groups for attack by good nucleophiles such as amines (see 
Torchilin et al.. r.m^ R*v Theraoe -^ nmr f r.*nto Systems. 7(4):275-308 (1991). expressly 
incorporated herein). Proteinaceous candidate agents may also be attached using other techniques 
known in the art. for example for the attachment of antibodies to powers; see Slinkin et al.. BjoconL 
ChenL2:342-348 (1991); Torchilin et al., supra; Trubetskoy et al.. Bioconj. Chem. 3 :323-327 (1992); 
King et al.. Cancer Res. 54 :6176-6185 (1994); and Wilbur et al.. Bio gonjuqate Chem. 5 :220-235 
(1994). all of which are hereby expressly incorporated by reference). It should be understood that the 
0 candidate agents may be attached in a variety of ways, including those listed above. What is 

important is that manner of attachment does not significantly alter the functionality of the candidate 
agent; that is. the candidate agent should be attached in such a flexible manner as to allow its 
interaction with a target 

Specific techniques for immobilizing enzymes on microspheres are known in the prior art. In one case. 
5 NH 2 surface chemistry microspheres are used. Surface activation is achieved with a 2.5% 
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glutaraldehyde in phosphate buffered saline (10 mM) providing a P H of 6.9. (138 mM Nad, 2.7 mM. 
KCI) This is stirred on a stir bed for approximately 2 hours at room temperature. The microspheres 
are then rinsed with u.trapure water plus 0.01 % tween 20 (surfaotant) -0.02%. and rinsed again wth a 
p H 7.7 PBS plus 0.01 % tween 20. Finally, the enzyme is added to the solution, preferabiy after bemg 
prefiltered using a 0.45um amicon micropure filter. 

,n some embodiments, the microspheres may additionally comprise identifier binding .igands for use in 
certain decoding systems. By "identifier binding .igands" or ,BLs" herein is meant a »* 
will specifically bind a corresponding decoder binding .igand (DBL) to faCitate the eluadauono the^ 
identity of the btoactive agent attached to the bead. That is. the IBL and the corresponding DBL form 

sufficient to differentiate between the corresponding DBL and other DBL. (that is. DBLs for <*her 
,BLS) or other components or contaminants of the system. The binding should be sufficient to remam 
bound under the condKions of the decoding step, induding wash steps to remove non-specific bmd.ng. 
,„ some embodiments, for example when the .BLs and corresponding DBLs are proteins or nuctae 
acids the dissociation constants of the IBL to its DBL will be less than about 10-10 M , with less 
than about 10- to 10- M" 1 being preferred and iess than about 10* -10* M< being particularly 
preferred. 

IBL-DBL binding pairs are known or can be readi.y found using known techniques. For example, when 
the IBL is a protein, the DBLs include proteins (particularly including anybodies or fragments thereof 
(FAbsetJorsmal. molecule, or vice versa (the IBL is an antibody and the DBL is a protein). Metal 
ion- metal ion .igands or che.ators pairs are a.so useful Antigen-antibody pairs, enzymes and 
substrates or inhibitors, other protein-protein interacting pairs, receptor-ligands. complementary 
nucleic add, and carbohydrates and their binding partners are also suitable binding pa.rs. Nuc^ic 
acid - nudeic acid binding proteins pairs are also useful. Similarly, as is genera..y descnbed , U.S. 
Patents 5.270.163. 5.475,096. 5.567.588. 5.595.877. 5.637.459. 5.683.867.5.705.337 an re^te 
patents, hereby incorporated by reference, nudeic acid 'aptomers" can be ~ 

wide body of literature relating to the development of binding pairs based on combinatona. chem,stry 
methods. 

,n a preferred embodiment, the IBL is a molecule whose co.or or luminescence propels change in 
the presence of a sele^ely-binding DBL. For example, the IBL may be a fluorescent pH .ndicator 
whose emission intensity changes with pH. Simi.ar.y, the IBL may be a fluorescent ion md.cator. 
whose emission properties change with ion concentration. 
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Alternatively, the IBL is a mo.ecu,e whose co.or or luminescence properties change in the presence of 
v oussolt, For example, the IBL may be a fluorescent molecule such as an ethidium salt whose 
fluorescence intensity increases in hydrophobic environments. Similarly, the IBL may be a denvative 
of fluorescein whose color changes between aqueous and nonpolar solvents. 

In one embodiment, the DBL may be attached to a bead. i.e. a "decoder bead", that may car^ a label 
such as a fluorophore. 

,„ a preferred embodiment, the IBL-DBL pair comprise substantially complementary single-stranded 
nucleicacid, In this embodiment, the binding ligands can be referred to as -identifier probes and 
•decoder probes". Generally, the identfier and decoder probes range from about 4 basepa.rs ,n length 
to about 1000, with from about 6 to about 100 being preferred, and from about 8 to about 40 be.ng 
partial preferred. What is important is that the probes are tong enough to be specrfic ..e. to 
distinguish between different IBL-DBL pairs, yet short enough to allow both a) dissociate, ,f 
necessary, under suitable experimental conditions, and b) efficient hybridation. 

.napreferred embodiment, as is more fu..y outlined below, the IBLs do not bind to ^.Rather, the 
,BLs are used as identifier moieties ('.Ms") that are identified direct*, for example through the use of 
mass spectroscopy. 

,„ , prefer, embodiment, the spheres comprise an qM stature that can be user .to 

coming that bioactive agent- be Men** onthe bass ofthe signal. M 

anodK, M o. mmm* is used. for example «nen beads o, different s*es are used, orwhen 
array Is loaded sequentially with different batches ot beads. 

,„ a purred embodiment, me optol sfcnature is general* a *« of reporter dyes, preferatfy 
ftaomscent By va^ing both to composition of the mixture (i.e. the ratio of one dye to another) and 

alternate, by entrapping the dye «» the bead. The dyes may be chromophrxes 0, phosphors 
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but are preferably fluorescent dyes, which due to their strong signals provide a good signal-to-noise 
ratio for decoding. Suitable dyes for use in the invention include, but are not limited to, fluorescent 
lanthanide complexes, including those of Europium and Terbium, fluorescein, rhodamine, 
tetramethylrhodamine, eosin, erythrosin, coumarin, methyl-coumarins. pyrene, Malacite green, 
stilbene. Lucifer Yellow, Cascade Blue™, Texas Red. and others described in the 1989-1991 
Molecular Probes Handbook by Richard P. Haugland. hereby expressly incorporated by reference. 

In a preferred embodiment, the encoding can be accomplished in a ratio of at least two dyes, although 
more encoding dimensions may be added in the size of the beads, for example. In addition, the labels 
are distinguishable from one another; thus two different labels may comprise different molecules (i.e. 
two different fluors) or. alternatively, one label at two different concentrations or intensity. 

In a preferred embodiment, the dyes are covalently attached to the surface of the beads. This may be 
done as is generally outlined for the attachment of the bioactive agents, using functional groups on the 
surface of the beads. As will be appreciated by those in the art, these attachments are done to 
minimize the effect on the dye. 

In a preferred embodiment, the dyes are non-covalently associated with the beads, generally by 
entrapping the dyes in the bead matrix or pores of the beads. Fluorescent dyes are generally 
preferred because the strength of the fluorescent signal provides a better signal-to-noise ratio when 
decoding. Additionally, encoding in the ratios of the two or more dyes, rather than single dye 
concentrations, is preferred since it provides insensitivity to the intensity of light used to interrogate the 
reporter dye's signature and detector sensitivity. 

In one embodiment, the dyes are added to the bioactive agent, rather than the beads, although this is 
generally not preferred. 

In one embodiment, the microspheres do not contain an optical signature. 

In a preferred embodiment, the present invention does not rely solely on the use of optical properties 
to decode the arrays. However, as will be appreciated by those in the art, it is possible in some 
embodiments to utilize optical signatures as an additional coding method, in conjunction with the 
present system. Thus, for example, as is more fully outlined below, the size of the array may be 
effectively increased while using a single set of decoding moieties in several ways, one of which is the 
use of optical signatures one some beads. Thus, for example, using one "set" of decoding molecules, 
the use of two populations of beads, one with an optical signature and one without, allows the effective 
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doubling of the array size. The use of multiple optical signatures similarly increases the possible size 



of the array. 



In a preferred embodiment, each subpopulation of beads comprises a plurality of different identifier 
binding ligands ("IBLs"). By using a plurality of different IBLs to encode each bioactive agent, the 
number of possible unique codes is substantially increased. That is, by using one unique IBL per 
bioactive agent, the size of the array will be the number of unique IBLs (assuming no "reuse" occurs, 
as outlined below). However, by using a plurality of different IBLs per bead, n, the size of the array 
can be increased to 2", when the presence or absence of each IBL is used as the indicator. For 
example, the assignment of 10 IBLs per bead generates a 10 bit binary code, where each bit can be 
designated as T (IBL is present) or "0» (IBL is absent). A 10 bit binary code has 2 1 ° possible variants 
However, as is more fully discussed below, the sfce of the array may be further increased if another 
parameter is included such as concentration or intensity; thus for example, if two different 
concentrations of the IBL are used, then the array size increases as 3". Thus, in this embodiment, 
each individual bioactive agent in the array is assigned a combination of IBLs. which can be added to 
the beads prior to the addition of the bioactive agent, after, or during the synthesis of the bioactive 
agent, i.e. simultaneous addition of IBLs and bioactive agent components. 

Alternatively, when the bioactive agent is a polymer of different residues. i.e. when the bioactive agent 
is a protein or nucleic acid, the combination of different IBLs can be used to elucidate the sequence of 
the protein or nucleic acid. 

Thus, for example, using two different IBLs (IBL1 and IBL2). the first position of a nucleic acid can be 
elucidated: for example, adenosine can be represented by the presence of both IBL1 and IBL2; 
thymidine can be represented by the presence of IBL1 but not IBL2, cytosine can be represented by 
the presence of IBL2 but not IBL1 . and guanosine can be represented by the absence of both. The 
second position of the nucleic acid can be done in a similar manner using IBL3 and IBL4; thus, the 
presence of IBL1. IBL2. IBL3 and IBL4 gives a sequence of AA; IBL1. IBL2, and IBL3 shows the 
sequence AT; IBM. IBL3 and IBL4 gives the sequence TA. etc. The third position utilizes IBLS and 
IBL6. etc. In this way. the use of 20 different identifiers can yield a unique code for every possible 10- 



mer. 



The system is similar for proteins but requires a larger number of different IBLs to identify each 
position, depending on the allowed diversity at each position. Thus for example, if every amino acid is 
allowed at every position, five different IBLs are required for each position. However, as outlined 
above, for example when using random peptides as the bioactive agents, there may be bias built into 
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the system; not all amino acids may be present at all positions, and some position may be preset; 
accordingly, it may be possible to utilize four different IBLs for each amino acid. 

In this way, a sort of "bar code" for each sequence can be constructed; the presence or absence of 
each distinct IBL will allow the identification of each bioactive agent. 

5 In addition, the use of different concentrations or densities of IBLs allows a "reuse" of sorts. If, for 
example, the bead comprising a first agent has a 1X concentration of IBL, and a second bead 
comprising a second agent has a 10X concentration of IBL, using saturating concentrations of the 
corresponding labelled DBL allows the user to distinguish between the two beads. 

In a preferred embodiment, the compositions of the invention further comprise at least one fiducial. By 
10 "fiducial" or "marker" or "registration point" herein is meant a physical reference feature or 

characteristic that allows precise comparisons of sequential data images of an array. The use of 
fiducials is useful for a variety of reasons. In general, the assays involve monitoring of objects, i.e. 
bioactive agents, located at spatially distinct locations (features) over the course of several data image 
frames taken over time. Any shifting that occurs from frame to frame complicates the analysis of the 
15 agents. By incorporating permanent fiducials into the assay structure, each data image can be 

aligned, either manually or automatically, to allow accurate comparison of the images, and control for 
translation (i.e. a shift in an X-Y direction) and/or rotation as well as reduction or enlargement of the 
image. In addition, when fluorescence based assays are used (either for decoding or analyte 
assaying or both), in any given image, a particular region or feature may or may not emit fluorescence, 
2 0 depending on the label characteristics and the wavelength being interrogated, or the presence or 

absence of an analyte or DBL, etc. The presence of fluorescence is detected as a positive change in 
feature intensity with respect to the background intensity, which is then used to draw a software 
"segment" over the core. In situations where the core is dark, i.e. no fluorescence is detected at that 
particular feature, it is difficult to accurately draw the segment over the core. 

25 Accordingly, in a preferred embodiment, at least one fiducial is incorporated into the array. In a 

preferred embodiment, a plurality of fiducials are used, with the ideal number depending on the size of 
the array (i.e. features per fiducial), the density of the array, the shape of the array, the irregularity of 
the array, etc. In general, at least three non-linear fiducials are used; that is, three fiducials that define 
a plane (i.e. are not in a line) are used. In addition, it is preferred to have at least one of the fiducials 

30 be either on or close to the periphery of the array. 

In a preferred embodiment, the substrate is a fiber optic bundle and the fiducial is a fiducial fiber. As 
will be appreciated by those in the art, the characteristics of a fiducial fiber may vary widely. For 
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example, in a preferred embodiment, the flducia, flbers ma, have unk,ue or spec* opto! proves; 
11.1 spec*™ gfcsses are avaiiable ,n a wide range o, composes, an. otten possess 

rray In add-on. as is true tor a. me fttM technics herein, I is o«en preferred .0 have d,,fe,e . 

flducials coux. be 0. a dWeren. shape „ s*e i. .he other three or o.s«nal* oft* 

The flducia, .bers ma, each he tabeled with the same or with ditferen. labels, in a prefer** 
Lodimen, a flber k cated with a single labei. in addition, multipfe flbers to be incorporate * an 

i* — - * mana.emativeembod^eacho.apiu^ot^, 
rsslbeled^adinerentordisceteiabei. in a preferred embodiment^.*, compos 

quantum dots. 

' Arrays comprising fiducial libers are generally made in a vanety ot ways. 

such as cuantum do* a. the met.no stage, prior to the maohining o, the glass » rods tor draw,ng. 

auchasafluorescen, dopant materia,. Prefened embodin»n B u« M ,noman« na„op3«es 
ILme* ph=tos*bfe over long period o. time (i.e. resist to photob,eact»g Th 

perferred embodiment a po**perse c**n o. particles is employed to grve nse to broad 

absorbtive and emissive properties. 

flducia,. and a labe, such as a fluorescent pa«e is added to me exfeno, »'™ n f 
nse to a flber mat exhibits fluorescent properties ot bom me internal end extern,, dopants. 

«dt*» «gh, can be made ,0 propagate down me core. ex*g any fluorescent matena, c<esen, 
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^ intprface between the core and the clad. The 
fiber where it is detected, for example by a CCD camera. 

number ana complexity of labels and/or registers increases. 

====== ====■ 

techniques) large fiducial beads may be la.d down m certa,n s,tes. As abo 
"label" for the array. 



array. 



marker bead is particularly preferred. 

following the addition of other microspheres, 
.napreferredembo^^^ 

appreciated by those in the art. this may be done -n a vanety of ways In o 

o sheath of fiducial material, such as highly fluorescent glass, is .nccrporated ,nto the ar^ 

:=-:===£======- 
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identifying features along its length. Alternatively, the fiducial materia, is placed in discrete spots or 
discrete shapes; basically, any orientation that allows translation, rotation, enlargement or reduction of 
images to be detected can be used. 

in a preferred embodiment, an exogeneous fiducial is not used; rather, .nherent characteristics of the 
array are used. That is. rather than incorporate a special feature into the array to serve as a fiducial, 
the inherent variability of the features of an array is used to create a sort of "fiducial template". In this 
embodiment, an image of the array is taken under conditions in which all the features are illuminated 
evenly and can be differentiated from one another. For example, the surface of the substrate can be 
illuminated with white light in such a way that a., the features are illuminated evenly. Th,s finds 
, particular use when the substrate is a fiber optic bundle with etched wells, in that the nomination angle 
and intensity is chosen such that the light reflecting off the beads differs in intensity from the l-ght 
reflecting off the cladding and spacer material. Preferred embodiments utilfce polarized light or hght 
impinging at various angles. Alternately, the surface of the array may be contacted w,th a 
fluorescent solution, allowing fluorescence to be collected equally by all the features. 

5 , n a preferred embodiment, the invention provides the use of an image produced by a randomly 

ordered array to identify and/or label the array. When forming a random array, many, but not all. of the 
microwells on an array are filled with microsphere, The filled versus unfilled sites on the array are 
randoms; thus, an image or a comport, of images of an array that details the filled from unfilled 
.ocations on the array serves as a unique identfler of the array. Thus, the image of a particular array 

0 is statistically different and distinct from an image or a composite of images of another array even 
though the different arrays have functional equivalence. By "statistically different" is meant that 
although there is a theoretical probabilrty that two arrays may be similar, the probability is so small as 
to be insignificant or unimportant. 

in one embodiment, the arrays have at least one subpopulation of microspheres. The pattern on the 
25 array created by the random assembly of microspheres on the array serves to identify the P art,cu ar 
array. The image of the array registers the location of each bead such that composite images taken 
from the array can be compared directJy. For example, an image produced by an array after exposure 
to a first substance can be directly compared with the same array exposed to a second substance. 
Atternatively. a single population of microspheres can be analyzed by multiple wavelengths and 
3 o directly compared. 

,n another embodiment, these arrays have two or more subpopulation* represented in each array. 
Because the arrays are assembled randomly, the individual locations of beads representing each 
subpopulation are randomized. Thus, an image or a composite of images that registers the .ocation of 
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each bead in a particular subpopulation will be statistically different from an image or composite of 
images of another array even though the different arrays have functional.equivalence. Likewise, the 
image or composite of images for another subpopulation within the same array will be statistically 
different from an image or composite of images of another array. 

In addition, the number of beads of each subpopulation that actually populate a given array can vary. 
The specific number of beads within a subpopulation on an array is defined by a Poisson distribution. 
The variation in number of beads representing a subpopulation adds another dimension to identifying 
individual arrays. 

The recognition that functionally equivalent arrays result in different images affords one the possibility 
of using that difference to "fingerprint" each array. Essentially, each random array has a built >n 
method for identifying and tracking that array. 

Thus the invention facilitates the use of innate features within a randonTarray to identify and track a 
specific array. The ability to identify and track a specific array has important functionality ain quality 
control monitoring, inventory monitoring, performance monitoring and use monitoring. For example, 
the ability to identify an array will allow one to determine when it is used and whether it is reused. 

in a preferred embodiment, the template image is used to define a -grid" which is placed upon the data 
images The use of a template image to define the location of features is optional, although currently 
preferred. Using standard image processing software such as Image Pro (Media Cybernetics) a 
template is built based on this grid. This type of software allows the user to create simultaneous 
software segments to calculate the mean feature intensity over a region of interest using a simple, one 
step segmentation function. This software-based fiducial template can then be mapped onto each data 
image in the assay protocol to allow data collection for each region for each data image. See for 
example U.S. Patent No. 5.768.412. This allows the location of each array feature to be defined. 

Once the microspheres comprising the candidate agents and the unique tags are generated, they are 
added to the substrate to form an array. In general, the methods of making the arrays and of decoding 
the arrays is done to maximize the number of different candidate agents that can be uniquely 
encoded. The compositions of the invention may be made in a variety of ways. In general, the arrays 
are made by adding a solution or slurry comprising the beads to a surface containing the sites for 
attachment of the beads. This may be done in a variety of buffers, including aqueous and orgamc 
solvents, and mixtures. The solvent can evaporate, and excess beads removed. 
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» shou.d be noted that not a,, sites of an array may comprise a bead; that is, there may be some sites 
on the substrate surface which are empty. In addition, there may be some sites that contarn more 
than one bead, although this is not preferred. 

,t should additionally be noted that in some cases, empty sites can serve as fiducials. That is. 
consistently "dark" sites can a,so be used as fiducial, This finds particular use when the filhng 
efficiencies of the array are high; that is, when most sites contain a bead. In addition the da* s,tes 
a,so can be used to -fingerprint" the array as described above. That is, the image of Irght and dark 
sites serves to define or identify a partial array. This image also serves to register the array for 
comparison purposes. 

,n some embodiments, for example when chemical attachment is done, it is possible to attach the 
beads in a non-random or ordered way. For example, using photoactivatible attachment Irnkers or 
photoactivatible adhesives or masks, se.ected sites on the array may be sequentially rendered 
suitable for attachment, such that defined populations of beads are laid down. 

The arrays of the present invention are constructed such that information about the idenfity of the 
candidate agent is built into the array, such that the random deposition of the beads in the fit*r we..s 
can be "decoded" to a.low identification of the candidate agent at all positions. This may be done ,„ a 
variety of ways, and either before, during or after the use of the array to detect target molecules. 

Thus after the array is made, it is "decoded" in order to identify the .ocation of one or more of the 
bioactive agents, i.e. each subpopulation of beads, on the substrate surface. In genera . bom 

of which are described be.ow. requires the comparison of sequential data images to deterge the 
deferences between two data images. to^m^W*™"™"™ 

andthenthetworegisteredimagescanbecompared. ,n *is context, a "data image rncudesa 
primary data image or a reduction of the image; for exampie. the image may be reduced to a set X- 
Y coordinates with corresponding intensity values. 

in a preferred embodiment, this is done using a computer system comprising a processor and a 
computer readable memory. The computer readable memory comprises an acquis mod* that 
comprises computer code that can recede a data image from a random array and a reg.tra on 
module comprising computer code that can register the data image using at least one fiduc* 
including a fiducia. tempiate. to generate a registered data image. This registered data ,mage can 
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then be stored in a storage module as needed. This same computer code, or different code, if 
required can be used to receive additional data images and generate additional registered data 
images which also can be stored. The computer readable memory further comprises a companson 
module comprising computer code that can compare the registered data images to determine the 
differences between them, to a.low both decoding of the array and target analyte detect.cn. Th t .s 
wh en decoding is done.the comparison of at least two registered data imagesallows the ident.cat.on 
of the location of at least two unique bioactive agents on the array. 

Thus, using the systems described herein, a random array is decoded, .n a preferred embodiment a 
selecivedecoding system is used. In this case, on,y those microspheres exhibiting a change , the 
oZ signal asaresu, of the binding of a target ana.yte are decoded. This is common, done when 
the number of -hits", i.e. the number of sites to decode, is generally tow. That is. the array ,s first 
led under experiment, conditions in the absence of the target ana.ytes. The sample conta.n.ng 
the target analytes is added, and only those locations exhibiting a change in the optica. signal are 
decoded. For example, the beads at either the positive or negative signa. locations may be e her 
selectively tagged or released from the array(for example through the use of photocleavable ,,Rers 
^^^^ 

or alternative, all the positives are released and analyzed. Alternatively, the labels may compnse 
nalogenated aromatic compounds, and detection of the label is done using for example gas 
chromatography, chemical tags, isotopic tags mass spectral tags. 

,n a preferred embodiment, atomic force microscopy (AFM) is used to decode the array In this 
mbo„^ 

IBL Theforceof.nteracUonbetweenthe.BL/DBLismeasuredusingAFM. IN add-on, s,ce AFM 
has atomic resolution, a variety of other physical characteristics, including physical s ze and s ape can 
be used for decoding. For exampie, different "shaped" molecules could be used as .8 s; ,n th s 
embodiment, the AFM tip can comprise a DBL or just a moiety that can detect different surfaces, 
addition. AFM could be used as "nanotweezers" to deliver or recover beads to and from specf, 
locations on the array. 

As - be abated by .hose in the art m* ma, also he done In systems where the array * no, 
aecoded-. I e. mere need no. ever he a correlation ot head composition with loeafion. In th,s 
emood^* me heads are B aded on »e array, and the assay is™. The 
« rt s drying a chance in the cptel signal as » more *, ofed helow. are then ma** J. 

using fteropticarrays. tn a preferred emoodimen,. each head contains a tiuorescen. dye. Arce-ma 
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assay and the identification of the "positives" or "active beads", light is shown down either only the 
positive fibers or only the negative fibers, generally in the presence of a light-activated reagent 
(typically dissolved oxygen). In the former case, all the active beads are photobleached. Thus, upon 
non-selective release of all the beads with subsequent sorting, for example using a fluorescence 
activated cell sorter (FACS) machine, the non-fluorescent active beads can be sorted from the 
fluorescent negative beads. Alternatively, when light is shown down the negative fibers, all the 
negatives are non-fluorescent and the the postives are fluorescent, and sorting can proceed. The 
characterization of the attached bioactive agent may be done directly, for example using mass 
spectroscopy. 

Alternatively, the identification may occur through the use of identifier moieties ("IMs"). which are 
similar to IBLs but need not necessarily bind to DBLs. That is, rather than elucidate the structure of 
the bioactive agent directly, the composition of the IMs may serve as the identifier. Thus, for example, 
a specific combination of IMs can serve to code the bead, and be used to identify the agent on the 
bead upon release from the bead followed by subsequent analysis, for example using a gas 
chromatograph or mass spectroscope. 

Alternatively, rather than having each bead contain a fluorescent dye, each bead comprises a non- 
fluorescent precursor to a fluorescent dye. For example, using photocleavable protecting groups, 
such as certain ortho-nitrobenzyl groups, on a fluorescent molecule, photoactivation of the 
fluorochrome can be done. After the assay, light is shown down again either the "positive" or the 
"negative" fibers, to distinquish these populations. The illuminated precursors are then chemically 
converted to a fluorescent dye. All the beads are then released from the array, with sorting, to form 
populations of fluorescent and non-fluorescent beads (either the positives and the negatives or vice 
versa). 

In an alternate preferred embodiment, the sites of attachment of the beads (for example the wells) 
include a photopolymerizable reagent, or the photopolymerizable agent is added to the assembled 
array. After the test assay is run, light is shown down again either the "positive" or the "negative- 
fibers, to distinquish these populations. As a result of the irradiation, either all the positives or all the 
negatives are polymerized and trapped or bound to the sites, while the other population of beads can 
be released from the array. 

In a preferred embodiment, the location of every bioactive agent is determined using decoder binding 
ligands (DBLs). As outlined above, DBLs are binding ligands that will either bind to identifier binding 
ligands. if present, or to the bioactive agents themselves, preferably when the bioactive agent is a 
nucleic acid or protein. 
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in a preferred embodiment, as outlined above, the DBL binds to the IBL. 

In a preferred embodiment, the bioactive agents are sing«e-stranded nucleic acids and the DBL is a 

e r mldecoder P robeherein. A decoder probe that is substantially complementary 
rdXo.eis .ade and used to de^ 

he decoder probes shou,d be of sufficient length (and the decoding step run under su.tab.e 
sufficient specificity to allow the distinction of each candidate probe. 

In a preferred embodiment, the DBLs are either ft** or indirect, labeled. By "labeled- ^ein is 
in a preferred tQ enable 

meant that a compound has at .east one element » pe ^ 
the detection of the compound. In general, labels fall into three classes, a, v 

========= 

H^Lw— — — 0---— ■ l "«se m «*.^„n flbl n angl « a nd-DBLp a ,r 
can Be as described above for IBL-OBL pairs. 

Aecordin* me id^tmcauon o, the tocaBon of fhe indi^a, beads (or stations of «ads> is 

« ' con,pr,,» 9 a^ 9Mh « CTm e 1 a0e,dDBtande*e, m e 

r:rrri:r:r^r,r:r n ds t oa,B La ..,o 

::™ 1 ,3dd i «o,aso U .,n.d.e rel , decode — - 

to in an assay, during the assay, or after the assay. 

encoded ,n ,r*er din*ns», ,.e. in M bead «. - fab* » — 

,s made vba. win spe*=a»v bind » * - «**» a BS ' *" , „ . 

fiuorochromes. Th u s, the iden,«y of each CBL. boih » oomposi»on ,i.e. • seouenc. »*«n «» 
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agents under conditions which allow the formation of complexes (termed hybridization complexes 
when the components are nucleic acids) between the DBLs and either the bioactive agents or the 
IBLs, the location of each DBL can be elucidated. This allows the identification of the location of each 
bioactive agent; the random array has been decoded. The DBLs can then be removed, if necessary, 
and the target sample applied. 

In a preferred embodiment, the number of unique labels is less than the number of unique bioactive 
agents, and thus a sequential series of decoding steps are used. To facilitate the discussion, this 
embodiment is explained for nucleic acids, although other types of bioactive agents and DBLs are 
useful as well. In this embodiment, decoder probes are divided into n sets for decoding. The number 
of sets corresponds to the number of unique tags. Each decoder probe is labeled in n separate 
reactions with n distinct tags. All the decoder probes share the same n tags. The decoder probes are 
pooled so that each pool contains only one of the n tag versions of each decoder, and no two decoder 
probes have the same sequence of tags across all the pools. The number ofpools required for this to 
be true is determined by the number of decoder probes and the n. Hybfidization of each pool to the 
array generates a signal at every address. The sequential hybridization of each pool in turn will 
generate a unique, sequence-specific code for each candidate probe. This identifies the candidate 
probe at each address in the array. For example, if four tags are used, then 4 X n sequential 
hybridizations can ideally distinguish 4" sequences, although in some cases more steps may be 
required. After the hybridization of each pool, the hybrids are denatured and the decoder probes 
removed, so that the probes are rendered single-stranded for the next hybridization (although it is also 
possible to hybridize limiting amounts of target so that the available probe is not saturated. Sequential 
hybridizations can be carried out and analyzed by subtracting pre-existing signal from the previous 
hybridization). 

An example is illustrative. Assuming an array of 16 probe nucleic acids (numbers 1-16). and four 
unique tags (four different fluors. for example; labels A-D). Decoder probes 1-16 are made that 
correspond to the probes on the beads. The first step is to label decoder probes 1-4 with tag A, 
decoder probes 5-8 with tag B, decoder probes 9-12 with tag C, and decoder probes 13-16 with tag D. 
The probes are mixed and the pool is contacted with the array containing the beads with the attached 
candidate probes. The location of each tag (and thus each decoder and candidate probe pair) is then 
determined. The first set of decoder probes are then removed. A second set is added, but this time, 
decoder probes 1. 5. 9 and 13 are labeled with tag A, decoder probes 2, 6, 10 and 14 are labeled with 
tag B, decoder probes 3, 7. 1 1 and 15 are labeled with tag C. and decoder probes 4, 8, 12 and 16 are 
labeled with tag D. Thus, those beads that contained tag A in both decoding steps contain candidate 
probe 1; tag A.ir, the first decoding step and tag B in the second decoding step contain candidate 
probe 2; tag A in the first decoding step and tag C in the second step contain candidate probe 3; etc. 



31 



WO 00/47996 PCT/USOO/03375 

l„ one embodiment, the decoder probes are labeled in situ; that is. they need not be labeled prior to 
the decoding reaction. In this embodiment, the incoming decoder probe is shorter than the cand.date 
probe creating a 5" "overhang" on the decoding probe. The addition of labeled ddNTPs (each labeled 
with a unique tag) and a polymerase will allow the addition of the tags in a sequence specific manner, 
thus creating a sequence-specific pattern of signals. Similarly, other modifications can be done, 
including ligation, etc. 

in addition, since the size of the array will be set by the number of unique decoding binding ligands. it 
is possible to "reuse" a set of unique DBLs to allow for a greater number of test sites. Th,s may be 
done in several ways; for example, by using some subpopulations that comprise optical signatures. 
Similarly the use of a positional coding scheme within an array; different sub-bundles may reuse the 
set of DBLs Similarly, one embodiment utilizes bead size as a coding modality, thus allowing the 
reuse of the set of unique DBLs for each bead sfce. Alternatively, sequential partial loading of arrays 
with beads can also allow the reuse of DBLs. Furthermore, "code sharing" can occur as well. 

In a preferred embodiment, the DBLs may be reused by having some subpopulations of beads 
comprise optica, signatures. In a preferred embodiment, the optica, signature is generally a mixture of 
reporter dyes, preferably fluorescent. By varying both the composition of the mixture (i.e. the ratio of 
one dye to another) and the concentration of the dye (leading to differences in signal intensrty). 
matrices of unique optica, signatures may be generated. This may be done by covalentiy attaching the 
dyes to the surface of the beads, or alternatively, by entrapping the dye within the bead. The dyes 
may be chromophores or phosphors but are preferably fluorescent dyes, which due to their strong 
signals provide a good signal-to-noise ratio for decoding. Suitable dyes for use in the invention 
include but are not limited to. fluorescent lanthanide complexes, inching those of Europium and 
Terbium fluorescein, rhodamine. tetramethylrhodamine. eosin. erythrosin, coumann. methyl- 
coumarins, pyrene. Malacite green, sti.bene. Lucifer Yellow, Cascade Blue-. Texas Red. and others 
described in the 6th Edition of the Molecular Probes Handbook by Richard P. Haugland. hereby 
expressly incorporated by reference. 

,n a preferred embodiment, the encoding can be accomplished in a ratio of at least two dyes, although 
more encoding dimensions may be added in the s*e of the beads, for example. In addition, the labels 
are distinguishable from one another; thus two different labels may comprise different molecules (..e. 
two different fluors) or. alternatively, one label at two different concentrations or intens.ty. 

,n a preferred embodiment, the dyes are covalently attached to the surface of the beads. This may be 
done as is general* outlined for the attachment of the bioactive agents, using functional groups on the 
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surface of the beads. As will be appreciated by those in the art. these attachments are done to 
minimize the effect on the dye. 

ln a preferred embodiment, the dyes are non-covalently associated with the beads, genera.iy by 
entrapping the dyes in the pores of the beads. 

Ad** encoding t fte ra«os o, ft. *> or more dyes, ratner man single d,e coneenWons is 

signature and detector sensitivity. 

.efcpbone system. 1. subarra, » - ~ "»< - "" — ^ ' '''tTfor 
ntlrs, 0 other subar^s. mat are separated by rtfe. o, ft. Mft. 0. me subar*,. Tbus. «* 

„ combined w«b too different subarrays oan torn, an array o, 5000 dfleren, b»a»e agent,. ,n 

subarray. 

ln a^e er.0dir.nu. additiona, ending parameters oan be ^^^^ 
For e-ampie. me us. o. different s*e beads may atso a»ow me reuse of sets of D6L , .bat ,s. 
nossibletousemierospf^resofdifferents^^ 

me same sfce oross-seotions can be used, but )us. wim different seed beads. With different diameters, 
dll^onuctectideseaue^c^^^ 

rio^csobst.fes.m.as-iasmeomermemods^ned bemoan be used ** omer 
substrates and with other attachment modalities as well. 

micrLpheresintothea,., As ou«ned above tor spate, oodmg. ,n thts — 
statures oan M -reused". ,n mis embodiment me fifcary 0, mrtrospheres eaob compnsrng 
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different bioactive agent (or the subpopulations each comprise a different bioactive agent). ,s divided 
into a plurality of sublibraries; for example, depending on the size of the desired array and the number 
of unique tags, 10 sublibraries each comprising roughly 10% of the total library may be made, w,th 
each sublibrary comprising roughly the same unique tags. Then, the first sublibrary is added to the 
fiber optic bundle comprising the wells, and the location of each bioactive agent is determmed, 
generally through the use of DBLs. The second sublibrary is then added, and the location of each 
bioactive agent is again determined. The signal in this case wi.l comprise the signal from the "first 
DBL and the "second" DBL; by comparing the two matrices the location of each bead m each 
sublibrary can be determined. Similarly, adding the third, fourth, etc. sublibraries sequentially will allow 
the array to be filled. 

,n a preferred embodiment, codes can be "shared" in several ways. In a first embodiment, a single 
code (i e IBL/DBL pair) can be assigned to two or more agents if the target analytes different 
sufficiently in their binding strengths. For example, two nucleic acid probes used in an mRNA 
quantitation assay can share the same code if the ranges of their hybridization signal intensities do not 
overlap This can occur, for example, when one of the target sequences is always present at a much 
higher concentration than the other. Alternative*, the two target sequences might always be present 
at a similar concentration, but differ in hybridization efficiency. 

Alternatively, a single code can be assigned to multiple agents if the agents are functionally equivalent. 
For example, if a set of oligonucleotide probes are designed with the common purpose of detecting the 
presence of a particular gene, then the probes are functionally equivalent, even though they may drffer 
in sequence. Similarly, if dasses of analytes are desired, all probes for different members of a class 
such as kinases or G-protein coup.ed receptors cou.d share a code. SirnHarly. an array of this type 
could be used to detect homologs of known genes. In this embodiment, each gene is represented by 
a heteroiogous set of probes, hybridizing to different regions of the gene (and therefore diffenng .n 
sequence). The set of probes share a common code. If a homolog is present, it might hybridize to 
some but not all of the probes. The level of homology might be indicated by the fraction of probes 
hybridizing, as well as the average hybridization intensity. Similariy, multiple antibodies to the same 
protein could all share the same code. 

Once made, the compositions of the invention find use in a number of applications. In a preferred 
embodiment, the composes are used to probe a sample solution for the presence or absence of a^ 
target ana^te. including the quantification of the amount of target analyte present. By "target analyte 
or "anaryte" or grammatical equivalents herein is meant any atom, molecule, ion. molecular ,on. 
compound or particle to be either detected or evaluated for binding partners. As wil. be appreciated by 
those in the art. a large number of analytes may be used in the present invention; basically, any target 
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analyte can be used which binds a bioactive agent or for which a binding partner (i.e. drug candidate) 
is sought 

Suitable analytes include organic and inorganic molecules, including biomolecu.es. When detection of 
a target analyte is done, suitable target analytes include, but are not limited to, an environmental 
pollutant (including pesticides, insecticides, toxins, etc.); a chemical (including solvents, polymers, 
organic materials, etc.); therapeutic molecules (including therapeutic and abused drugs. ant,b,ot,cs. 
etc )• biomolecules (including hormones, cytokines, proteins, nucleic acids, lipids, carbohydrates, 
cellular membrane antigens and receptors (neural, hormonal, nutrient, and cell surface receptors) or 
their ligands. etc); whole cells (including procaryotic (such as pathogenic bacteria) and eukaryot.c 
cells including mammalian tumor cells); viruses (including retroviruses, herpesviruses, adenoviruses, 
lentiviruses, etc.); and spores; etc. Particularly preferred anarytes are nucleic acids and prolans. 

in a preferred embodiment, the target analyte is a protein. As will be appreciated by those in the art. 
there are a large number of possible proteinaceoustarget analytes that may be detected or evaluated 
for binding partners using the present invention. Suitable protein target analytes include, but are not 
limited to, (1) immunoglobulins; (2) enzymes (and other proteins); (3) hormones and cytokines (many 
of which serve as ligands for cellular receptors); and (4) other proteins. 

in a preferred embodiment, the target analyte is a nucleic acid. These assays find use in a wide 
variety of applications. 

in a preferred embodiment, the probes are used in genetic diagnosis. For example, probes can be 
made using the techniques disclosed herein to detect target sequences such as the gene for 
nonpolyposis colon cancer, the BRCA1 breast cancer gene. P53. which is a gene associated w,th a 
variety of cancers, the Apo E4 gene that indicates a greater risk of Alzheimer disease, allowing for 
easy oresymptomatic screening of patients, mutations in the cystic fibrosis gene, cytochrome p450s or 
any of the others well known in the art. 

In an additional embodiment, viral and bacterial detection is done using the complexes of the 
invention. In this embodiment, probes are designed to detect target sequences from a variety of 
bacteria and viruses. For example, current blood-screening techniques rely on the detect™ of an* 
HIV antibodies. The methods disclosed herein allow for direct screening of clinical samples to detect 
HIV nucleic acid sequences, particularly highly conserved HIV sequences. In addition. th,s allows 
direct monitoring of circulating virus within a patient as an improved method of assessing the efficacy 
of anti-viral therapies. Similarly, viruses associated with leukemia, HTLV-I and HTLV-II, may be 
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detected in this way. Bacterial infections such as tuberculosis, chlamydia and other sexually 
transmitted diseases, may also be detected. 

,n a preferred embodiment, the nucleic acids of the invention find use as probes for toxic bacteria in 
the screening of water and food sample, For example, samples may be treated to lyse the bactena 
to release its nucleic acid, and then probes designed to recognize bacteria, strains, including, but not 
.imited to such pathogenic strains as. Sa/mor,e//a. Campylobacter, Vibrio cftofrae. Le*«nw». 
enterotoxic strains of £ co/,. and Legionnaire's disease bacteria. Similarly, bioremediation strategies 
may be evaluated using the compositions of the invention. 

in a further embodiment, the probes are used for forensic "DNA fingerprinting" to match crime-scene 
DNA against samples taken from victims and suspects. 

in an additional embodiment, the probes in an array are used for sequencing by hybridization. 

The present invention also finds use as a methodology for the detection of mutations or mismatches in 
target nucleic acid sequences. For example, recent focus has been on the analysis of the relationsh.p 
between genetic variation and phenotype by making use of polymorphic DNA markers. Prevous work 
utilized short tandem repeats (STRs) as polymorphic positional markers; however, recent focus ,s on 
the use of single nucleotide polymorphisms (SNPs). which occur at an average frequency of more 
than 1 per kilobase in human genomic DNA. Some SNPs. particularly those in and around cod,ng 
sequences, are likely to be the direct cause of therapeutical* relevant phenotypic variants. There are 
a number of well known pdymorphisms that cause clinically important phenotypes; for example, the 
apoE2/3/4 variants are associated with different relative risk of Alzheimer's and other diseases (see 
Cordor et a«., Science 261(1993). Multiplex PGR amplification of SNP loci with subsequent 
hybridization to oligonucleotide arrays has been shown to be an accurate and reliable mrtod o 
simultaneously genotyping at least hundreds of SNPs; see Wang et a... Science. 280. 077 (19 8), 
see also Schafer et al.. Nature Biotechnology 16:33-39 (1998). The compositions of the present 
invention may easily be substituted for the arrays of the prior art. 

.n a preferred embodiment, the compositions of the invention are used to screen bioactive agents to 
find an agent that wi.l bind. and preferably modify the function of, a target molecule. As above, 
variety of different assay formats may be run. as will be appreciated by those in the art. Genera ly. the 
target analyte for which a binding partner is desired is labeled; binding of the target anaryte by the 
bioactive agent results in the recruitment of the labe. to the bead, with subsequent detection. 
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,n a preferred embodiment, the binding of the bioactive agent and the target ana.yte is specific; that is 
the bioactive agent specify binds to the target anaiyte. By "specificaHy bind" herein . meant that 
the agent binds the anaiyte. with specificity sufficient to differentiate between the ana.yte and o her 
components or contaminants of the test sampie. However, as wil. be appreciated by those in the art. ,t 
wi „ be possible to detect anaiytes using binding which is not highly specific; for example, the systems 
may use different binding .igands, for example an array of different ligands, and detection of any 
particuiar anaiyte is via its "signature" of binding to a pane, of binding .igands. similar to the manner ,n 
which "electronic noses" work. This finds particuiar utility in the detection of chemical anaiytes. The 
binding should be sufficient to remain bound under the condKions of the assay, including wash steps 
to remove non-specific binding, aithough in some embodiments, wash steps are not desired. ..e. for 
detecting .ow affinity binding partner, In some embodiments, for example in the detection of certa-n 
biomo.ecu.es, the dissociation constants of the anaiyte to the binding ligand wii. be less thar .about 
10 .. 10 , M -, witn less than about 10* to 10* being preferred and .ess than about 10" -10" M" 
being particularly preferred. 

Genera,,,, a sampie containing a targe, ana*, (terror detedon of me targe, ana*, or 
screening » binding par*.* 0, me target «*■» is added to the arrav, under ~ suitab* for 
binding 0. the target anaMe t. at teas, one of tne b,oac«ve agents, i.e. genera,!, ***** 

ndL. The resenc.oreb.enoeoftn.u.rgetana^.istnende^d. As w, be appr.aa.ed b, 
mose in tne art this ma, be don. in a variety 0. wavs, genera,* .hrough the us. of a change ,n an 
op 6 ca, signal. This Chang, can occur « man, d«e,.n. mechanisms. 

b nding of a digged ana„1e to the bead, the predion of a d,e spades on or n.a, the bead,. me 
dye on bead, or any other optical interrogatable event. 

,„ a preferred embodiment change in W «ca, sfcna, occurs as a resu,. o, the birding o, a t^et 

such as a .ucrochrome. Thus, for e,am P ,e. when a proteinaceous targe, ana*.e ,s used. n», be 
either dire* ,abe,ed with a «uor, or indirect,,, fo, e*amp* through the use 0, afcbeled -** 
Sim»anv, nucleic acids are ees«, tabeied with .uoropho, for examp,e dunng PCR amp ~ . 

used as me PreferentJallv assocate w«hdoub* stnanded 

usual* reversibh, Hybhd^on Mo*, include intercalate* and minor and/oc major groove b, d*g 

moieties. In a preferred embodiment. motors ma, be used; since intercalation generatr, or* 

me fcbel light up. Thus, upon bindrng 0, me target anafvte to a bioacW agent, men, ,s a new opbca, 
signal generated at mat site, which then ma, be detected. 
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Alternatively, in some cases, as discussed above, the target analyte such as an enzyme generates a 
species that is either directly or indirectly optical detectable. 

Furthermore, in some embodiments, a change in the optica, signature may be the basis of the optical 
signal For example, the interaction of some chemical target analytes with some fluorescent dyes on 
the beads may alter the optical signature, thus generating a different optical s,gnal. 

As will be appreciated by those in the art. in some embodiments, the presence or absence of the 
target analyte may be done using changes in other optical or non-optical signals, including, but not 
,imited to. surface enhanced Raman spectroscopy, surface plasmon resonance, radioactivity, etc. 

Again as outlined above for decoding, the assay for the presence or absence of a target analyte 
utilizes sequential processing of data images using a computer system. Thus, in a preferred 
embodiment, a first data image of a random array is acquired using an acquisition module of the 
computer system. This initial date image may be decoded, i.e. the location of some or all of the 
bioactive agents may be known, or decoding may occur either during or after the assay. A registration 
module of the computer system is used to create a registered first data image, using either an 
exogeneous fiducial or a fiducial template generated by acquiring a template data image as outhned 
herein, for example by evening illuminating the array. The sample is then added to the array, and a 
second data image is acquired using the acquisition modu.e. The fiducial and registration module are 
then used to create a registered second data image. A companion module of the computer system . 
then used to compare the registered data images to determine the presence or absence of sa.d target 
analyte. 

The assays may be run under a variety of experimental conditions, as will be appreciated by those in 
the art A variety of other reagents may be included in the screening assays. These include reagents 
,ike salts, neutral proteins, e.g. albumin, detergents, etc which may be used to facilitate optima, 
protein-protein binding and/or reduce non-specific or background interactions. Also reagents that 
otherwise improve the efficiency of the assay, such as protease inhibitors, nuclease inhibitors 
anti-microbia. agents, etc.. may be used. The mixture of components may be added in any order that 
provides for the requisite binding. Various blocking and washing steps may be utilized as ,s known ,n 



the art. 



in a preferred embodiment, two-color competitive hybridization assays are run. These assays can be 
based on traditional sandwich assays. The beads contain a capture sequence located on one s,de 
(upstream or downstream) of the SNP. to capture the target sequence. Two SNP allele-specfic 
probes each labeled with a different fluorophor. are hybridized to the target sequence. The genotype 
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can be obtained from a ratio of the two signals, with the correct sequent generally exhibiting better 
binding This has an advantage in that the target sequence itseif need not be labeled. In add-on. 
since the probes are competing, this means that the conditions for binding need not be optima. 
Under conditions where a mismatched probe would be stably bound, a matched probe can st.ll 
displace it. Therefore the competitive assay can provide better discrimination under those condit,ons. 
Because many assays are carried out in parallel, conditions cannot be optimzed for every probe 
simultaneously. Therefore, a competitive assay system can be used to help compensate for non- 
optimal conditions for mismatch discrimination. 

,n a preferred embodiment, dideoxynucleotide chain-termination sequencing is done using the 
compositions of the invention. In this embodiment, a DNA polymerase is used to extend a pnmer 
using fluorescent* labeled ddNTPs or other chain terminating nucleotides. The 3' end of the primer ,s 
.ocated adjacent to the SNP site. In this way. the single base extension is complementary to the 
sequence at the SNP site. By using four different fluorophors. one for each base, the sequence of the 
SNP can be deduced by comparing the four base-specific signals. This may be done in several ways, 
in a f,rst embodiment, the capture pror* can be extended; in this approach, the probe must either be 
synthesized 5'-3' on the bead, or attached at the 5' end. to provide a free 9 end for polymerase 
extension. Alternatively, a sandwich type assay can be used; in this embodiment, the target ,s 
captured on the bead by a probe, then a primer is annealed and extended. Again, in the tatter case, 
the target sequence need not be labeled. In addition, since sandwich assays require two spectfc 
interactions, this provides increased stringency which is particularly helpful for the analysis of complex 
samples. 

,n addition, primer extension is possible; extension of a primer bound to template in .iquid phase is 
followed by capture of the extended primer on the array. 

,n addition, when the target analyte and the DBL both bind to the agent, it is also possible to do 
detection of non-labelled target analytes via competition of decoding. 

,n a preferred embodiment, the methods of the invention are useful in array quality control. Prior to 
this invention, no methods have been described that provide a positive test of the performance of 
every probe on every array. Decoding of the array not only provides this test, it also does so by 
making use of the data generated during the decoding process itself. Therefore, no add.tiona 
experimental work is required. The invention requires only a set of data analysis algorithms that can 
be encoded in software. 
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The quality control procedure can identify a wide variety of systematic and random problems in an 
array For example, random specks of dust or other contaminants might cause some sensors to g.ve 
an incorrect signal-this can be detected during decoding. The omission of one or more agents from 
multiple arrays can also be detected. An advantage of this quality control procedure is that ,t can be 
imp,emented immediate prior to the assay itself, and is a true functional test of each ,ndiv.dual 
sensor Therefore any problems that might occur between array assembly and actual use can be 
detected In applications where a very high level of confidence is required, and/or there is a significant 
chance of sensor failure during the experimental procedure, decoding and quality control can be 
conducted both before and after the actual sample analysis. 

,n a preferred embodiment, the arrays can be used to do reagent quality control. In many instances, 
biological macromo.ecu.es are used as reagents and must be qua.ity controlled. For example, large 
sets of o.igonuc.eotide probes may be provided as reagents, .t is typically difficult to perform quality 
control on large numbers of different biological macromolecu.es. The approach described here can be 
used to do this by treating the reagents (formulated as the DBLs) as variable instead of the arrays. 

in a preferred embodiment, the methods outlined herein are used in array calibration. For many 
applications, such as mRNA quantitation, it is desirable to have a signal that is a linear response to the 
concentration of the target ana.yte. or. alternative* . if non-linear, to determine a relationship between 
concentration and signal, so that the concentration of the target analyte can be estimated. 
Accordingly, the present invention provides methods of creating calibration curves in parallel for 
multiple beads in an array. The calibration curves can be created under condittons that simulate the 
complexity of the sample to be analyzed. Each curve can be constructed independently of the others 
(e g for a different range of concentrations), but at the same time as all the other cu,ves for the array. 
Thus in this embodiment, the sequential decoding scheme is implemented with different 
concentrations being used as the code "labels", rather than different fluoroses. In this way. s.gnal 
as a response to concentration can be measured for each bead. This calibration can be carried out 
just prior to array use. so that every probe on every array is individually calibrated as needed. 

in a preferred embodiment, the methods of the invention can be used in assay development as well. 
Thus for example, the methods allow the identification of good and baa probes; as is understood by 
those in the art. some probes do not function well because they do not hybrid* well, or because they 
cross-hybrid* wrth more than one sequence. These problems are easily detected during decrtng. 
The ability to rapidly assess probe performance has the potential to greatly reduce the time and 
expense of assay development. 
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Similarly, in a preferred embodiment, the methods of the invention are useful in quantitation in assay 
development Major challenges of many assays is the ability to detect differences in analyte 
concentrations between samples, the ability to quantitate these differences, and to measure absolute 
concentrations of analytes, all in the presence of a complex mixture of related analytes. An example 
of this problem is the quantitation of a specific mRNA in the presence of total cellular mRNA. One 
approach that has been developed as a basis of mRNA quantitation makes use of a multiple match 
and mismatch probe pairs (Lockhart et al., 1996), hereby incorporated by reference in its entirety. 
While this approach is simple, it requires relatively large numbers of probes. In this approach, a 
quantitative response to concentration is obtained by averaging the signals from a set of different 
probes to the gene or sequence of interest. This is necessary because only some porbes respond 
quantitatively, and it is not possible to predict these probes with certainty. In the absence of prior 
knowledge, only the average response of an appropriately chosen collection of probes is quantitative. 
However, in the present invention, that can be applied generally to nucleic acid based assays as well 
as other assays. In essence, the approach is to identify the probes that respond quantitatively in a 
particular assay, rather than average them with other probes. This is (tone using the array calibration 
scheme outlined above, in which concentration-based codes are used. Advantages of this approach 
include: fewer probes are-needed; the accuracy of the measurement is less dependent on the number 
of probes used; and that the response of the sensors is known with a high level of certainty, since 
each and every sequence can be tested in an efficient manner. It is important to note that probes that 
perform well are selected empiriically, which avoids the difficulties and uncertainties of predicting 
probe performance, particularly in complex sequence mixtures. In contrast, in experiments described 
to date with ordered arrays, relatively small numbers of sequences are checked by perfomring 
quantitative spiking experiments, in which a known mRNA is added to a mixture. 

All references cited herein are incorporated by reference in their entirety. 
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CLAIMS 

We claim: 

1. An array composition comprising: 

a) a substrate with a surface comprising discrete sites; and 

b) a population of microspheres comprising at least a first and a second subpopulation, 
wherein each subpopulation comprises a bioactive agent; and 

c) at least one fiducial; 

wherein said microspheres are distributed on said surface. 

2. An array composition according to claim 1 wherein each subpopulation comprises a unique optical 
signature. 

3. An array composition according to claim 1 wherein each subpopulation comprises an identifier 
binding ligand that will bind a decoder binding ligand such that the identification of the bioactive agent 
can be elucidated. 

4. An array composition according to claim 1 wherein said substrate is a fiber optic bundle and said 
fiducial is a fiducial fiber. 

5. An array composition according to claim 1 wherein said substrate is a fiber optic bundle, said array 
comprises at least three non-linear fiducials, and each of said fiducials is a fiducial fiber. 

6. An array composition according to claim 5 wherein at least one of said fiducial fibers has a different 
shape from the others. 

7. An array composition according to claim 1 wherein said fiducial is a defined edge of said substrate. 

8. An array composition according to claim 1 wherein said fiducial is a fiducial bead. 

9. An array composition according to claim 1 wherein said bioactive agents are nucleic acids. 

10. An array composition according to claim 1 wherein said bioactive agents are proteins. 

1 1. An array composition according to claim 1 further comprising a computer readable memory 
comprising: 

a) computer code that receives a first data image; and 
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b) computer code that registers said first data image using said fiducial to generate a first 
registered data image. 

12 . An array composition according to claim 1 1 wherein said computer readable memory further 
comprises: 

a) computer code that receives a second data image; 

b) computer code that registers said second data image using said fiducial to generate a 
second registered data image; and 

c) computer code that compares said first and said second data image. 

,3. A composition comprising a computer readable memory to direct a computer to function in a 
specified manner, said computer readable memory comprising: 

a) an acquisition module for receding a data image of a random array compnsmg a plural,* 

of discrete sites; 

b) a registration module for registering a data image; and 

c) a comparison module for comparing registered data images. 

14. A composition according to claim 13 wherein said random array comprises a fiber optic bundle 
and said registration module utilizes a fiducial fiber for registration. 

15. A composition according to Cairn 13 wherein said random array emprises microspheres and said 
registration module utilizes a fiducial microsphere for registration. 

16. A composition according to claim 13 wherein said registration module utilizes a fiducial template 
for registration. 

17. A composition according to claim 13 further comprising a random array comprising: 

a) a substrate with a surface comprising discrete sites; and 

b) a population of microspheres comprising at least a first and a second subpopulation. 
wherein each subpopulation comprises a bioactive agent; 

wherein said microspheres are distributed on said surface. 



1 B. A method of making an array composition comprising: 

a) forming a surface comprising individual sites on a substrate; 
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b) distributing microspheres on said surface such that said individual sites contain 
microspheres, wherein said microspheres comprise at least a first and a second 
subpopulations each comprising a bioactive agent; and 

c) incorporating at least one fiducial onto said surface. 

19. A method according to claim 18 wherein said subpopulations further comprise an identifier binding 
iigand that will bind a decoder binding ligand such that the identification of the bioactive agent can be 
elucidated. 

20. A method according to claim 18 wherein said subpopulations further comprise an optical signature 
such that the identification of the bioactive agent can be elucidated. 

21 . A method according to claim 18 wherein said substrate is a fiber optic bundle and said fiducial is a 
fiducial fiber. 

22. A method according to claim 18 wherein said substrate is a fiber optic bundle, said array 
comprises at least three non-linear fiducials, and each of said fiducials is a fiducial fiber. 

23. A method according to claim 122 wherein at least one of said fiducial fibers has a different shape 
from the others. 

24. A method according to claim 18 wherein said fiducial is a defined edge of said substrate. 

25. A method according to claim 18 wherein said fiducial is a fiducial bead. 

26. A method according to claim 18 wherein said bioactive agents are nucleic acids. 

27. A method according to claim 18 wherein said bioactive agents are proteins. 

28. A method for comparing separate data images of a random array comprising: 

a) using a computer system to register a first data image of said random array to produce a 
registered first data image; 

b) using said computer system to register a second data image of said random array to 
produce a registered second data image; and 

c) comparing said first and said second registered data image to determine any differences 
between them. 
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29. A method according to claim 28 wherein said random array comprises a f.oer optic bundle and the 
registration of said first data image utilizes a fiducial fiber. 

30. A method according to claim 28 wherein said random array comprises microspheres and the 
registration of said first data image utilizes a fiducial microsphere. 

31. A method according to Cairn 28 wherein the the registration of said first data image utilizes a 
fiducial template. 

32. A method of decoding a random array composition comprising 

a) providing a random array composition comprising: 

i) a substrate with a surface comprising discrete sites; and 

ii) a population of microspheres comprising at least a first and a second 
subpopulation. wherein each subpopulation comprises a bioactive agent; 

wherein said microspheres are distributed on said surface; 

b) adding a first plurality of decoding binding ligands to said array composition and creating 
a first data image; 

c) using a fiducial to generate a first registered data image; 

d) adding a second plurality of decoding binding ligands to said array composition and 
creating a second data image; 

e) using said fiducial to generate a second registered data image; and 

0 using a computer system to compare said first and said second registered data image to 
identify the location of at least two bioactive agents. 

33. A method according to Cairn 32 wherein said random array comprises a fiber optic bundle and the 
registration of said first data image utilizes a fiducial fiber. 

34. A method according to claim 32 wherein said random array comprises microspheres and the 
registration of said first data image utilizes a fiducial microsphere. 

35. A method according to claim 32 wherein the the registration of said first data image utilizes a 
fiducial template. 

36. A method according to claim 32 wherein said bioactive agents are proteins. 

37. A method according to claim 32 wherein said bioactive agents are nucleic acids. 
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38 A method of determining the presence of a target analyte in a sample comprising: 

a) acquiring a first data image of a random array composition compns-ng: 

i) a substrate with a surface comprising discrete sites; and 

ii) a population of microspheres comprising at least a first and a second 
subpopulation each comprising a bioactive agent; 

wherein said microspheres are distributed on said surface such that said discrete 
sites contain microspheres; 

b) registering said first data image to create a registered first data image; 

c) contacting said random array composition with said sample; 

d) acquiring a second data image from said array with said sample; 

e) registering said second data image to create a registered second data image; and 

0 comparing said first and said second registered data images to determine the presence or 
absence of said target analyte. 

39. A method according to claim 38 wherein said random array comprises a fiber optic bundle and the 
registration of said first data image utilizes a fiducial fiber. 

40. A method according to claim 38 wherein said random array comprises microspheres and the 
registration of said first data image utilizes a fiducial microsphere. 

41 a method according to claim 38 wherein the the registration of said first data image utJIizes a 
fiducial template. 

42. A method according to claim 38 wherein said bioactive agents are proteins. 

43. A method according to claim 38 wherein said bioactive agents are nucleic acids. 
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